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^ ABSTRACT 

^ ■ We present Mgll -based black hole mass estimates for 27,602 quasars with 
rest-frame UV spectra available in the Sloan Digital Sky Survey Data Release 

Q ■ Three. This estimation is possible due to the existence of an empirical correlation 

^ ■ between the radius of the broad line region and the continuum luminosity at 3000 

■ Angstroms. We regenerate this correlation by applying our measurement method 
to UV spectra of low-redshift quasars in the HST/IUE databases which have cor- 

O . responding reverberation mapping estimates of the H /3 broad line region's radius. 

^ ■ Our mass estimation method uses the line dispersion rather than the full width 
i at half maximum of the low- ionization Mg ll emission line. We measure Mg ll line 
I ■ dispersions for quasars whose spectra have been reconstructed using the most sig- 

■ nificant eigenspectra produced through Principal Component Analysis. We have 
00 ■ 

^ . tested the reliability of using reconstructed spectra in black hole mass estimation 

. using a Monte Carlo simulation and by comparing the results from original and 

• . reconstructed Data Release Three spectra. We show that using reconstructed 

O ' spectra not only makes bias-free mass estimation possible for quasars with low 

^ . spectroscopic signal-to-noise ratio, but also reduces the intrinsic scatter of the 

. distribution of the black hole masses to lower than 0.15 dex. 

X: 

^ ■ Subject headings: black hole physics - galaxies: active - galaxies: fundamental 

■ ■ ■ parameters - galaxies: high-redshift - quasars: general - surveys 



1. Introduction 

Over the past decade, we have learned that all large galaxies have supermassive black 
holes at their centers (e.g., Kormendy & Richstone et al. 1995; Richstone et al. 1998). 
Material falling onto black holes forms a rapidly spinning disk which heats up and radiates 
the light we see as quasars and other active galactic nuclei (AGN) (e.g., Salpeter 1964; 
Zel'dovich & Novikov 1964; Lynden Bell 1969). The masses of these black holes can be 
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measured through different techniques. In galaxies with a dormant black hole, the BH mass 
can be measured through the velocity dispersion of stars or gas close enough to the BH which 
their dynamics are dominated by BH gravity (within the radius of influence; e.g., Gebhardt 
ct al. 2000; Ferrarese & Merritt 2000; Tremaine et al. 2002). However, this method cannot 
be used for active nuclei where the nucleus outshines the core of the host galaxy (e.g., 
Richstone et al. 1998; Gebhardt et al. 2000; Ferrarese & Merritt 2000; Tremaine et al. 2002). 
Thus, alternative mass measurement methods are necessary, but are AGN type dependent 
(Blandford & McKee 1982; Peterson 1993; Miyoshi et al. 1995). In nearly edge-on (Type 
2) AGNs, mass can be measured from water masers located in the outer accretion disk 
(Miyoshi et al. 1995). However, water masers cannot be used widely due to the extreme 
edge-on alignment needed for radiation to be sufficiently amplified. In more face-on (Type 
1) AGNs, one can instead use reverberation mapping when variations of the broad emission 
lines relative to the continuum are present (e.g., Koratkar & Gaskell 1991; Kaspi et al. 2000; 
Peterson ct al. 2004; Kaspi et al. 2005; Bentz et al. 2006; Bentz 2009). 

Reverberation mapping (RM) studies (Blandford & McKee 1982; Peterson 1993) take 
advantage of long monitoring campaigns of AGNs to infer properties of the broad line emit- 
ting region (BLR) through characterization of time delays (r) between broad emission line 
and continuum flux variations (e.g., Koratkar & Gaskell 1991; Kaspi et al. 2000; Kaspi et 
al. 2005; Bentz et al. 2006; Bentz 2009). In principle, RM can generate a velocity-delay 
map (revealing kinematic structure in the BLR) which can help one to estimate the black 
hole mass independent of other methods (see Done & Krolik 1996; Ulrich & Horne 1996; 
KoUatschny 2003; Horne et al. 2004; Denney et al. 2009; Bentz et al. 2009). However, at 
the moment, one still has to calibrate quasar black hole masses to nearby well studied dor- 
mant black holes (assuming that they are descendants of the active BHs; see, e.g., Kaspi ct 
al. 2000; Krolik 2001; McLure & Dunlop 2001; Onken et al. 2004). It has taken many years 
of RM campaigns to obtain a large sample of such mass measurements even for a handful 
of low redshift quasars (e.g.. Done & Krolik 1996; Ulrich & Horne 1996; KoUatschny 2003; 
Peterson 2006; Bentz et al. 2009; Denney et al. 2009). The RM results show an empirical 
correlation between the distance of the BLR from the BH (which is the measured time-lag 
T times the speed of light) and a monochromatic continuum luminosity (known as R — L 
or T — L correlation; Kaspi et al. 2000; Bentz et al. 2006). This R — L correlation simpli- 
fies the mass measurement since determining the radius of the BLR using long monitoring 
campaigns can be replaced by measuring the monochromatic continuum luminosity from a 
single epoch spectrum. 

The RM results also show a relationship between emission line widths {AV) and cor- 
responding time-lags (e.g., Peterson & Wandel 1999, 2000; Onken & Peterson 2002; Kol- 
latschny 2002) which confirms the virialization of the BLR. This relationship enables a 
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secondary method of BH mass determination from ^ oc (AVY if one assumes that the AV 
of a viriahzed emission hne (e.g., H /3) reflects the viriahzed bulk motion of the emitting gas 
(e.g., Peterson & Wandel 1999, 2000; Onken & Peterson 2002; KoUatschny 2003). By mea- 
suring the line width of a particular line (using the FWHM or the line dispersion) and the 
luminosity of the AGN at a particular wavelength, one can calibrate the virial mass equation 
of the secondary method for a sample of RM black hole masses. That equation can then be 
used to estimate unknown quasar BH masses (e.g., Wandel et al. 1999; Vestergaard 2002; 
McLure & Jarvis 2002; Woo & Urry 2002; Vestergaard & Peterson 2006; Shen et al. 2008; 
Wang et al. 2009). 

Much work on distant black hole mass estimates has used the hydrogen H fS emission 
line and the 1549 A doublet of triply ionized carbon (Civ); e.g., Kaspi et al. (2000), Vester- 
gaard (2002), McLure & Jarvis (2002) and Vestergaard & Peterson (2006). can be 
identified in low redshift quasars {z < 0.8) with observations from ground-based telescopes 
in the optical or near infrared, while the C IV emission line can be identified in high redshift 
quasars {z > 2) in those bands. Thus, H/3 provides well calibrated masses but cannot be 
seen in the most distant black holes with ground based telescopes, while CiV can be seen 
but could be strongly contaminated by non-gravitational motions. For example, the peak of 
the Civ emission line is usually blueshifted, probably implying an outflow of material (e.g., 
Leighly et al. 2000; Yuan et al. 2004; Vestergaard 2009). 

We have assembled a catalogue of black hole mass estimates using the 2798 A doublet 
of singly ionized magnesium (Mgll), which can be observed by ground-based telescopes in 
the optical or near infrared for even the most distant quasars and which is thought to probe 
largely the same region of the accretion disk as H /3 (McLure & Jarvis 2002; but see Wang et 
al. 2009). Although there is no RM study which demonstrates the viriahzation of the Mgll 
emission line, BH mass estimates using Mg ll show consistency with similar estimates using 
H^ (McLure & Jarvis 2002), which is considered a viriahzed hne (Peterson & Wandel 1999). 

For our mass estimates, we use the hne dispersion of the line profile instead of FWHM 
since the relation between rms line dispersion and RM time-delay appears to be less scattered 
than the same relation with rms FWHM, at least for the H/3 line (Peterson et al. 2004). 
The line dispersion uses the data directly but is sensitive to the wings of the line profile 
(Colhn et al. 2006). The FWHM is the FWHM of a fit of one or more Gaussians to the hne 
profile (e.g, McLure & Dunlop 2004; Shen et al. 2008), and the FWHM of a single Gaussian 
fit in particular may be a poor representation of the true FWHM of a line. Furthermore, it 
is not confirmed that the Mg ll FWHM is linearly related to the rms FWHM of H /5 (Wang 
et al. 2009). Shen et al. (2008) have provided a BH mass catalogue with mass estimates 
made using the FWHMs of Mg ll, H /3 and C IV. However, they have not cahbrated the Mg ll 
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scaling relation but instead have adopted the McLure & Dunlop (2004) relationship, despite 
there being many updated RM mass measurements since 2004. Onken & Kollmeier (2008) 
have shown that the Shen et al. (2008) Mgll mass estimates are Eddington ratio dependent, 
which may indicate a problem with FWHM-based Mgll mass estimates. We are therefore 
motivated to test the alternative, the line dispersion of Mg ll. 

To reduce the uncertainties in the line dispersion due to low signal-to-noise ratio spectra 
and the sensitivity of the technique to the line wings, we have reconstructed the quasar spec- 
tra using the Karhunen-Locve transformation technique (also called Principal Components 
Analysis; PCA) e.g., Boroson and Green (1992); Connolly et al. (1995); Yip et al. (2004); 
Boroson et al. (2010). Reconstructing the quasar spectra reduces the noise sufficiently for 
us to use the line dispersion robustly. 

The outline of this paper is as follows. In §2, we describe how we measure the second 
moment of the Mg II line profile, including how we model Fe ll emission lines and establish a 
local continuum. In §3, we cahbrate a mass scaling relation using the Mgll hne dispersion 
and the monochromatic continuum luminosity at 3000A. In §4, we describe how we select our 
data sample from the SDSS. In §5, we explain how we reduce the noise level by reconstructing 
the SDSS quasar spectra using PCA. In §6, we study the effect of the radiation pressure on 
BH mass estimates and calibrate a new mass scaling relation for that scenario. In §7, we 
simulate the effects of noise in reconstructed spectra to understand the biases on BH mass 
estimates before and after reconstruction. In §8, we use the mass scaling relations to estimate 
the BH masses of SDSS quasars and present our BH mass catalogue. We then compare our 
BH mass estimates with Shen et al. (2008). In §9, we present our conclusions. 

In this study we assume a Lambda Cold Dark Matter {KCDM) cosmology with Hubble 
constant Hq = 71 km s^-*^ Mpc^^, fractional dark energy density Qj^ = 0.74 and fractional 
matter energy density Qrn = 0.26 (Spergel et al. 2007) to calculate necessary cosmological 
quantities. 



2. Black Hole Mass Estimates via SecondEiry Method 

To compute A^gfj from the virial equation 
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it is necessary to estimate the radius of the broad line region, Rblr, via a correlation with 
the AGN continuum luminosity when there is no reverberation mapping data available. The 
observed line width, AV, and the continuum luminosity can both be estimated from a single 
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epoch spectrum. The average geometrical factor / will remain as a free parameter to be 
adjusted such that it makes these secondary mass estimates most consistent with the virial 
mass estimates. The virial masses are in turn calibrated to direct mass measurements of 
nearby inactive black holes through the BH mass - bulge velocity dispersion (A^gpj — o"*) 
relationship (e.g., Gebhardt et al. 2000; Ferrarese & Merritt 2000; Tremaine et al. 2002). 
This calibration is valuable since there is no direct way to estimate the mass zero point for 
the quasar BH mass relationship, but it does assume that the same relationship is valid for 
quasars and for inactive BHs. 



2.1. Measuring the Line Width of an Emission Line 

The traditional way to measure the line width is to measure its FWHM. This is simple 
for single peaked lines and more complex for multiple-peaked lines or noisy data (for double- 
peaked hnes, there is a complete discussion in Peterson et al. 2004). The hue dispersion, 
which can be calculated without making any assumption about the line profile, can be defined 
by: 

^Le(A) =< > -Xl (2) 
where < > and Aq are the 2'^'' & 1** moments of the emission hne profile -P(A): 

f\'P{\)dX 
<^ JPi\)d\ 
_ JXP{X)dX 

^° " JPiXjdX' ^ ^ 

The second moment is more sensitive to blending with other lines or extended wings than 
the FWHM, but less sensitive to the presence of even strong narrow-line components. The 
FWHM versus time lag plots for emission lines show dramatic scatter (especially for H (3) 
when using the mean spectra (which are more similar to single epoch spectra than rms 
spectra are), while the hne dispersion, cr^ne; versus time lag plots show less scatter (Peterson 
et al. 2004). 

On the other hand, studies of RM samples have shown that AGNs with line dispersion, 
aiine, lower than 2000 km s~^ have FWHM/cTH„e < 2.35 while AGNs with aune > 2000 km s"^ 
have FWRM/ aiine > 2.35 (Sulentic et al. 2000; Collin et al. 2006; Peterson et al. 2007). In 
particular, Peterson et al. (2007) report that the observed value of FWHM/uHne for a sample 
of RM AGN ranges between ~ 0.71 and ~ 3.45 with an average value of ~ 2.0. This result 
clearly indicates the non-Gaussianity of line profiles for quasar emission lines. Thus using the 
FWHM as a line width indicator may overestimate the BH masses for the broadest emission 
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lines and underestimate the BH masses for the narrowest emission hnes as compared to BH 
mass estimates using the hne dispersion. 

The true hne dispersion can be determined through considering the instrumental reso- 
lution and writing the observed line width in terms of the intrinsic line dispersion, aintr and 
the spectrograph's instrumental line profile, ainst' 

'^Le = CTintr + (1 + ^)~'^'^inst (5) 

where the instrumental dispersion is observed-frame while the other two are rest-frame. The 
instrumental resolution of an SDSS spectrum is a function of plate, fiber, and wavelength. 
The wavelength dispersion in pixels as a function of wavelength for each fiber on a plate 
is stored in HDU ^^4 of the spPlate FITS files. For our objects, we need to know the in- 
strumental dispersion at the observed wavelength of Mgll. For each quasar, the observed 
wavelength of Mgll is calculated as Aq = 2800.26(1 + z). Using the spPlate file, the corre- 
sponding pixel in the quasar's SDSS spectrum is located and the wavelength dispersion d 
(in units of pixels) at that pixel is extracted. The pixel scale is 10~^ln(10)Ao A/pixel, so the 
instrumental dispersion in A is given by 2.302585 x lO^^Aorf. 



2.2. Fitting Method 

The rest-frame ultraviolet spectrum of an AGN can be approximated as a power-law 
continuum plus broad and overlapping Fe ll emission and broad emission lines from other ions 
such as Mg II. To measure the Mg ll hne, we must remove the contaminating Fe ll emission 
from AGN spectra. We subtract a best- fit Fe ll template and power law continuum estimate, 
which together form what is called a pseudo-continuum. 

To match the Fe ll hnes to the AGN spectra, we build an Fe ll model by convolving the 
Fell template generated from the nearby AGN I Zw 1 by Vestergaard & Wilkes (2001) with 
a Gaussian. The AGN spectra have lower resolution than the Fe ll template so we rebin the 
Fell template to match the AGN spectra in their rest frames. Vestergaard & Wilkes (2001) 
has been the standard empirical Fell template since its publication. Tsuzuki et al. (2006) 
present a new template, also derived from I Zw 1, which unlike that of Vestergaard & 
Wilkes (2001) shows some Fell emission at wavelengths 2780-2920 A where confusion with 
Mgll is an issue in constructing a template. While some such Fell emission is certainly 
present, Tsuzuki et al. (2006) derive their Fell template by assuming Gaussian line profiles 
for Mgll in I Zw 1. What they interpret as Fell emission at 2790 A could instead be a weak 
blue wing of Mgll emission. Furthermore, neither the theoretical Fell spectra considered 
by Tsuzuki et al. (2006) nor those presented by Bruhweiler & Verner (2008) show strong 
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evidence for emission at wavelengths corresponding to features at 2780-2920 A in the Tsuzuki 
et al. (2006) template. More work on Fell templates is clearly needed, but meanwhile we 
will use the Vestergaard & Wilkes (2001) template. 

The normalization and slope of the power law continuum are initially estimated using 
two normahzation windows, 2238 — 2248 A and 3014 — 3027 A, which are the least contam- 
inated from Fell or other hues (Vestegaard and Wilkes 2001). 

To allow improvement in the fitting quality we scale the Fell template independently 
above and below 2800A. This additional amplitude will be accepted if the new is lower 
than the 95% confidence lower limit on the old with only 4 free parameters. The best 
pseudo-continuum is then subtracted from the AGN spectra. 

We vary the amplitude and slope of the power law continuum, the amplitude of the Fe ll 
template above and below 2800A, and the width of the Gaussian convolved with the Fell 
template to find the best fit. Using a nonlinear Levenberg-Marquardt least-squares fitting 
method, we calculate the minimum value for the three fitting windows 2192 — 2400 A, 
2445 - 2724 A, and 2867 - 3027 A chosen to avoid the [Neiv] A2423 A and Mgll A2798 A 
emission lines during the Fe ll and continuum estimation. 

3. Determining the Mass Scaling Relationship 

To determine a mass scaling relationship which uses (TMgii and AL3000, we first need to 
investigate the existence of a relationship between AL3000 and Rblr- If that relationship ex- 
ists then we can scale our mass relationship to an existing sample of BH masses to determine 
the geometrical factor. 

The most direct measurements of the central black hole masses of powerful AGN come 
from the reverberation mapping studies of 17 Seyferts and 17 PG quasars by Wandel, Peter- 
son & Malkan (1999) and Kaspi et al. (2000), respectively. These studies arc based on the 
time lag measurement between the variation of the AGN continuum and the broad emission 
line H/3 of local AGN (low redshift objects with z < 0.4). We thus use this sample as our 
reference for the scaling process. 

We select 21 objects from this sample (Table 1 of McLure & Jarvis 2002) for which the 
Mg II FWHM has been calculated. We also have FWHM and H /3 line dispersion measure- 
ments for the same objects from Peterson et al. (2004). 

This list of AGN consists of 5 high resolution spectra from the Hubble Space Telescope 
(HST) Faint Object Spectrograph (FOS) and 16 low resolution spectra from the International 
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Ultraviolet Explorer (lUE) Long- Wavelength Prime (LWP) spectrograph (see Tabled]). For 
most of them there is more than one observation. In order to increase the signal-to-noise 
ratio and sometimes to increase the wavelength coverage, we combine different observations 
of each object by using a weighted average method. We adopt as the time-lag to the Mgll 
BLR the weighted average time-lag estimated using Ha, H/3 and H7, or whichever subset 
of those three are available, from Peterson et al. (2004). While these lines sometimes have 
different timelags, in this case the small number of Ha and H7 measurements available means 
that the resulting correlation is not significantly different if only the H/3 timelags are used. 

By using our fitting procedure, we find the best-fit broadened Fell template and the 
best power law continuum for each spectrum in Table [1] to create the corresponding pseudo- 
continuum. We subtract this pseudo-continuum from the spectrum so that we measure the 
line dispersion of the Mgll from the isolated emission line. The line dispersion for Mgll, 
AL3000, and the virial mass estimate can be seen in Table [1] column 3, 4, and 8 respectively. 

In a second approach, we use PC A- reconstructed spectra instead of raw spectra (see 
§[S]for details). However, using reconstructed spectra with high SNR does not improve the 
r — AL3000 correlation significantly. 

We estimate the slope and intercept of the linear correlation between the logarithms 
of tblr and i = ALaooo/lO^^ ergs~^ using several fitting techniques. However, we elimi- 
nate some of the objects from our list based on a large relative error of the average time- 
lag, ATa^erage/Taverage > 31.7%. Thcsc objccts are 3C390.3, PG0844 + 349, PG1229 + 204, 
NGC4051 and 3C120. 

Both the FITEXY (Tremaine et al. 2002; Press et al. 1992) and BCES methods (Akritas 
et al. 1996) gave a slope consistent with 0.5 within ^ la, with 20% uncertainty in the best 
case (see Table [2]). Given those results, and because a slope of 0.5 matches the theoretical 
slope suggested by Netzer et al. (1993) for a fixed ionization parameter at the innermost 
radius of the BLR and the observational slopes found for H/3 (Bentz et al. 2006) and Civ 
(Kaspi et al. 2007), we chose to adopt a fixed slope of 0.5 and recalculate the intercept from 
the data. In Figured] we show the tblr — ^ correlation calculated by MCMC0 along with the 
confidence regions for the correlation (Haario et al. 2006). The correlation that we adopt is: 



log{TBLR) = 0.5 log(£) + (1.469 ± 0.090), 



(6) 



)0.5 



[A Laooo/Werg s'^]. 



(7) 



-'^ Markov Chain Monte-Carlo Simulation 
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for ^ 1 with an intrinsic scatter of ~ 35%. Some of this intrinsic scatter arises because the 
luminosity measurements are not necessarily co-temporal with the time-lag measurements. 
However, quasar variability on timescales of 100 or more rest-frame days can be roughly 
described by a Gaussian with a^n = 0.2 magnitudes (Figure 12 of Vanden Berk et al. 2004). 
The corresponding additional scatter in the tblr-^ relationship should therefore be roughly 
described by a Gaussian with a = 0.04 dex, or about 10%. 

McLure and Jarvis (2002) estimated a correlation for Rblr with luminosity at 3000A 
by using the reverberation mapping data of 34 quasars from Wandel et al. (1999) and Kaspi 
et al. (2000) and found Rblr = (25.2 ± 3.0)£°-^^±° °5 where Rblr is in units of light days. 
Both of our correlations are well defined only for quasars with 0.1 < z < 0.5 within the 
luminosity range 0.001 < i < 100. However, we assume that our correlation will hold for 
objects with luminosity and redshift beyond those limits (as tentatively found for Civ by 
Kaspi et al. 2007, but see appendix Al of McLure & Dunlop 2004 for a contrary result). 

We can determine the average geometrical factor, /, in Equation 1 through comparing 
our virial mass estimates with the reverberation mass estimates by Peterson et al. (2004), 
which are calibrated to the A^bh — cr* correlation as described in Onken et al. (2004) @ We 
assume a linear fit with a fixed zero intercept for the virial product, VMbh = RBLR^yfntrl^^ 
versus reverberation black hole mass, RMbh- (A fit where the intercept is allowed to vary 
yields an intercept consistent with zero at ^ la). The reduced chi squared, is calculated 
from: 

^' [{VMbh).] -{RMbh] 



X. 



1 ^ r/T//i,f ^ f (u^j \ 12 

2 ^ \ ^ 



where the geometrical factor - which is the slope of the linear fit - is / = 5.3 ± 0.6 with 
— 1.92 (A^ = 15) considering an intrinsic scatter of eo — 35% for the estimated masses 
from both methods. The relationship between mass, luminosity and linewidth is: 

■^Bh/-^0 = 5.75/£°-VLr ± (statistical) 

±24% (systematic) 
±35% (intrinsic scatter) (9) 

where the constant 5.75 is the conversion factor for using BH masses in solar units when 
^ = [ALsooo/lO^^] is in units of erg s~^, and the intrinsic line dispersion of the Mgll emission 



^Note that / is not exactly the same as /, the geometrical factor used to scale reverberation-mapping 
virial products to the Mbh — relationship. We use / to scale single-epoch virial products to calibrated 
RM masses which already incorporate /. 
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line, aintr, is in units of km s ^. The statistical error of the black hole mass estimates, 
"^•^BH/"^©' '^^^ calculated from: 

^A^BH/-^0 = 2.875[^(a,VL. + IQ^.j'T'- (10) 

where (Jai„tr '^t- estimated errors from our fitting process. The scatter of 35% is 
equivalent to 0.15 dex. 

The systematic error of 24% (equivalent to 0.10 dex) comes from propagating the relative 
errors of the geometrical factor / and of the coefficient in the t^lr ~ C- relation (Equation 
|6]). It does not include any uncertainty in the exponent of the tblr — C- relation. 



4. The Sample 

In this study we use using spectra from the SDSS Data Release Three (DR3; Abazajian 
et al. 2005). The SDSS wavelength coverage sets the upper and lower limits for our sample 
selection. A spectrum needs to have the rest-frame wavelength of 2192-3027 A within the 
SDSS range, since we fit that region around the Mgll emission line (2798 A) with a power 
law continuum and Fell emission template. 

The above condition defines our acceptable redshift range, which is 0.7336 < zsdss < 
2.0393. However, the redshift given in the DR3 quasar catalog (Schneider et al. 2005), as 
described in §4.10.2.1 of Stoughton et al. (2002), is not always as accurate as we need to 
place the Mgll emission line sufficiently close to 2800 A in the adopted rest frame. Thus, 
we cross correlate the median quasar composite spectrum (Vanden Berk et al. 2001) with 
SDSS DR3 spectra (using the IRAfI task "xcsao") to redefine the object's redshift. These 
corrected redshift will be reliable only when relative change of the redshifts is smaller than 
2%. Consequently the new redshift range will come from | {znew — zsdss)\/{^ + ^sdss) — 2%. 
The top panel in Figure [2] shows the relative change in redshifts versus SDSS DR3 redshift, 
color-coded with the redshift confidence from the SDSS DR3 (Stoughton et al. 2002). Indeed, 
redshifts with lower confidence in SDSS DR3 tend to have larger redshift corrections. The 
bottom panel in Figure [2] shows the distribution of both redshift estimates. 

These new redshift limits, 0.6996 < Znew < 2.1013, however, include those objects where 
after redshift correction they have been shifted to the acceptable wavelength range (or maybe 
shifted out of the range), where the acceptable range is still defined by the normalization 
region. 



•^The Image Reduction and Analysis Facility, 



http://iraf.noao.edu/ 
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The luminosities of our calibration objects come from total magnitudes, but SDSS DR3 
spectra are scaled only to the flux in a 3" diameter aperture. The average correction factor 
we apply to the flux measured from SDSS DR3 spectra to produce the total flux of all our 
objects is —0.35 magnitudes, which is appropriate for unresolved objects in the SDSS DR3 
(see section 4.1.1 of Adelman-McCarthy et al. 2008). While this is an average correction, 
it is appropriate for quasars at all redshifts and luminosities in our sample {z > 0.7 and 
Mi < —24), as the fraction of SDSS quasars with extended morphologies at those luminosities 
is at most 2% (Figure 10 of Vanden Berk et al. 2006). 

Using our new quasar redshifts, the percentage of the successful fits in DR3 sample is 
about 79% of the initially selected sample. To increase the fraction of acceptable fits and 
to reduce the noise we later use the eigenspectra method to reconstruct the quasar spectra 
(see section [SJ. This increases the percentage of acceptable fits to 94%. The remaining 6% 
have problems such as a lack of data exactly at the Mg ll region, or a very broad absorption 
line very close to Mg ll or the normalization regions. In the end, our DR3 sample consists of 
93.3% of all quasars with 0.7336 < zsdss < 2.0393 (27602 quasars out of 29582). 

Note that the objects in our catalogue are not homogeneously selected. However, of 
the 9325 quasars in redshift range in the homogeneous subsample of Richards et al. (2006), 
our DR3 sample contains 8814 (94.5%). Quasars in that complete subsample are flagged in 
the DR3 mass catalogue, and only such quasars should be used for studies that require a 
volume-limited, homogeneously selected sample. However, even in such studies the effects 
of Malmquist bias must be taken into account (see §8.2p . 

5. Reconstructing quasar spectra from quasar eigenspectra 

Principal Component Analysis (PCA), sometimes called the Karhunen-Loeve transfor- 
mation, is an orthogonal linear transformation of data to a new coordinate system. As 
a result of this transformation the greatest variance by any projection of the data forms 
the first coordinate (called the first principal component or first eigenspectrum), the second 
greatest variance forms the second coordinate, and so on. Thus the great advantages of using 
PCA is that a number of possibly correlated variables transform into a smaller number of 
orthogonal variables called principal components. 

Yip et al. (2004) have applied this transformation to a sample of 16707 quasar spectra 
from the SDSS DRl to generate such principal components. They have performed the 
transformation on subsamples with different redshift {z) and absolute magnitude [Mi) in 
the i filter to generate eigenspectra for each (Mj, z) bin. Using this technique they separate 
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any possible luminosity effect on the spectra from any evolutionary effects. In each {Mi, z) 
bin, the flux values from all quasars as a function of rest-frame wavelength are the data 
points on which the PCA is run. Therefore, PCA generates eigenspectra which span the 
same rest-frame wavelength range as the input spectra. Yip et al. (2004) show that the first 
four eigenspectra in each (Mj, z) bin account for 82% of the total sample variance. These four 
eigenspectra represent the mean spectrum, a host-galaxy component, UV-optical continuum 
slope variations, and the correlations of Balmer emission lines respectively. We used all 50 
eigenspectra calculated by Yip et al. (2004) in the appropriate {Mi,z) bin to provide the 
most accurate reconstruction of our spectra possible. We considered using only the number 
of eigenspectra needed to reach xl = ^ for each object, but such an approach is not possible 
for all objects with the Yip et al. (2004) eigenspectra because Figure [5] shows that most of 
our objects would require more than 50 eigenspectra to reach = 1. 

We have used the 50 most significant eigenspectra from Yip et al. (2004) (the luminosity, 
redshift dependent eigenspectra) to reproduce the spectrum of each quasar. The reproduced 
spectra have the same continua as the original spectra, and the most significant features 
like emission lines are the same. However, the reconstructed spectra have less noise as 
compared to the original spectra. Any narrow absorption lines are removed through this 
method, and narrow regions of missing data are effectively reconstituted by the best-fit 
reconstruction found for the rest of the spectrum. In Figure [3] we show an example of an 
SDSS quasar spectrum before and after PCA reconstruction. The top panel shows the raw 
spectrum and our best pseudo-continuum estimate for it, and the middle panel the same for 
the reconstructed spectrum. The bottom panel shows the decontaminated spectrum which 
we use to measure the Mgll line dispersion. 

In Figure [5] we show the distribution of the reduced calculated for DR3 catalogue by 
comparing reconstructed spectra and raw spectra for two cases: first, using a window around 
Mgll between 2700A and 2900A and second, using the entire fitting region of 2192-3027A. 
For both cases, the peak is located near unity, indicating that the PCA reconstructions are 
statistically acceptable fits to the original spectra. 

6. Radiation Pressure Correction (RPC) on Scaling Relationship 

The virial theorem can be used for a dynamically relaxed gravitationally bound system. 
However, very luminous sources can violate this assumption. In such sources, the gravita- 
tional force of a BH can be reduced by the effect of the radiation pressure (Marconi et al. 
2008). If BLR clouds are attracted by a smaller effective force of gravity, their line disper- 
sion decreases, resulting in mistakenly underestimated virial BH mass estimates. Therefore, 
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the scaling relationships may therefore need to be recast to include the effect of radiation 
pressure. 

We have applied the Marconi et al. (2008) suggestion to correct the scaling relation- 
ship for the Mgll emission line such that A^gjj/A^0 = 5.75 fai'^'^crfntr + 9^ where i = 
[ALsooo/lO^^jerg s~^and amtr is the line dispersion of the Mgll line in units of km s~^. We 
use the same sample as in §[3] to determine the adjusted calibration factors, fa and g, through 
comparison of our virial mass estimates adjusted for radiation pressure with the black hole 
mass estimates made directly from the black hole mass - bulge velocity dispersion 

relationship for quiescent galaxies (Equation 1 in Tremaine et al. 2002). We assume that 
the relationship holds for AGNs and their host galaxies too. The velocity dispersions a* are 
extracted for 11 objects from Onken et al. (2004) and for two objects, PG1229-I-204 and 
PG2 130+099, from Dasyra et al. (2007). We assume a non-linear fit with a variable intrinsic 



scatter for the Adjusted Virial Mass (AVMbh) versus M%^. The reduced chi squared, x^i 
is calculated from: 



)i 



T\T n / ^ _2 I _2 I ,2 1-'--'-/ 



Where the adjusted calibration factors are fa = 2.4±1.5 and \ogg = 6.9±0.5 with xl — 1-00 
(A^ = 12; after excluding 3C390.3) considering an intrinsic scatter of cq — 15%. The adjusted 
mass scaling relation using Mgll emission line is: 

Mji^/M,j = 5.75 fai''al,,. + gi 
^^M^^/Mq (statistical) 
±15%(intrinsic scatter). (12) 

The statistical error of the black hole mass estimates, <^J\ylj^^/J\ylQy can be calculated from: 

^A^BHZ-^o = [af (2.875^ + gf + 132.2<_aL J.^O]"'^ (13) 

where (Ta,„tr ^'^d ae are estimated errors from our fitting process. The scatter of 15% corre- 
sponds to less than 0.07 dex of intrinsic scatter. We do not quote a systematic error (which 
comes from the systematic errors on fa and g) because its magnitude depends on the rela- 
tive values of the two terms in Equation [T21 which will be different for objects of different 
luminosities and linewidths. 

When the estimated line dispersion of Mgll is very small then limcr_^o -^bh/-^0 ~ 9^- 
Using our luminosity limits we have a lower limit on black hole mass estimates for the 

adjusted method of log( _y^^ ) ^ 6.9 for AL3000 > 10*^^ erg s~^. Whether this is a real 
constraint remains an open question. 
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7. Noise Simulation 

We now evaluate the statistical errors on BH mass estimates from reconstructed spectra 
due to the presence of different noise levels in quasar spectra before reconstruction. We 
select 100 quasars with the highest signal to noise ratio (SNR), from 20 to 50. Then we 
add randomly generated Gaussian noise to these spectra to decrease the SNR to 1/2, 1/4, 
1/8, and 1/16 of the original SNR (see Figure H]). We repeat this process for each new SNR 
6 more times to have at least 25 realizations for every spectrum, yielding 2500 spectra in 
total. We extract quantities like A -L3000 and <7Mgii (see section 12.11 and 12. 2p and for four 
different cases; before or after reconstructing the spectra by applying PGA and before or 
after applying the radiation pressure correction (RPG). 

Figure [6] shows, as a function of the average SNR in the / band, the differences in the 
logarithms of the black hole mass estimated after adding noise to spectra (MBH) and the 
mass estimated using original spectra with high SNR [OMBH) for four scenarios. As we can 
see from these Figures, the lower the SNR, the more the BH mass is uncertain with respect to 
the high SNR mass estimate. Gomparing Figure [6^ with Figure |6]d shows an improvement in 
removing the systematic underestimation of BH masses when the spectra are reconstructed 
using PGA. Gomparing Figure |6^ with Figure [6t shows a cutoff on the lower side of the 
mass estimates in Figure IHt. This lower limit is due to the asymptote approached when the 
estimated line dispersion is very small (limo-^o -^bh/-^Q ~ 3^) scenario when the 

RPG is applied. There were also three objects out of 100 selected with high SNR for which 
the results are catastrophic. One object is not being reconstructed properly due to an error 
on the redshift estimate, and two objects have a broad absorption line right in the Mgll 
window. Setting aside the 3% catastrophic errors, the scatter is normally distributed around 
zero when PGA reconstruction is applied (see, e.g.. Figure andlHli). 

In Figure [6^1, if there was no SNR dependency then we would expect a Gaussian distri- 
bution around zero. The negative trend in mass difference in low SNR level shows the high 
sensitivity of the method to the SNR. In Figure Et, the non-symmetrical trend in low SNR 
emphasizes the SNR dependency too. This implies that using second moment of the line 
profile to estimate the without PGA reconstruction is unbiased only for spectra with 
the very high SNR [SNR > 30) while after PGA reconstruction there is no such systematic 
bias with the SNR, making the PGA reconstruction a necessary step in precise estimation 
of BH mass algorithm. 

Gomparing the mass estimates from these different cases we can conclude that: using 
PGA improves the mass estimation scatter to better than 0.4 dex for SNR as low as 1.5 or 
to better than 0.2 dex for SNR of 18 (see Figures [7^1 andlTb)- This improvement is due to 
the sensitivity of the Fell modeling process and the second moment measurement to both 
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the SNR and the accuracy of the power-law estimates for quasar spectra. Using PCA along 
with considering the effect of the radiation pressure improves the mass estimation scatter 
to better than 0.2 dex for SNR of around 1.5 (see Figures [Tt and [711). Overall, BH mass 
estimates are around 50% less scattered for either high or low SNR if we consider the effect 
of the radiation pressure (compare Figure [6]d to[6li). However, BH masses are around 3 to 
4 times less scattered in the high SNR regime than in the low SNR regime, either before 
(Figure [6b) or after the radiation correction (Figure M^), but only if we reconstruct the 
spectra. 

Denney et al. (2009) have studied the effect of the SNR on reverberation BH mass 
estimates using both FWHM and line dispersion of quasar NGC 5548. They have reported 
a similar dependency on SNR as we see before reconstructing the spectra. 

8. Catalogue and Results 

Here we present a catalogue including black hole mass estimates for three scenarios: 
black hole masses before applying PCA and before RFC {MBHbb; see, e.g.. Figure [8^), 
BH masses after applying PCA but before RFC {MBHab, see, e.g.. Figures [8b), and BH 
masses after applying PCA and after RFC {MBHaa] see, e.g., Figures[Ht). We present several 
quantities in this catalogue including the augii and A L3000 (see Table [IJfor more information 
about the catalogue format). In Figures [8] and [9] we present plots of different quantities 
corresponding to these three different scenarios. Comparing panel (b) against panel (a) of 
these Figures represent the advantages of using PCA reconstruction while comparing panel 
(c) against panel (b) reveals the RFC. 

The artificially low black hole mass estimates (especially in the high redshift regime) in 
Figure [8^ are mostly removed after PCA reconstruction in Figure [8)d and [8I3. The masses 
were underestimated due to the decreased velocity range over which aune could be reliably 
measured in low SNR spectra. Forcing aune to be measured over a fixed velocity range re- 
gardless of SNR might reduce the mass underestimation in low-SNR unreconstructed spectra, 
but only at the cost of much larger scatter on aune- Figure [8h> shows that very massive black 
holes (> lO^'^M©; the red points in the color version) were active as quasars at z > 1.5. 
However, a more reasonable true upper mass limit in our redshift range, considering the 
errors which broaden the mass distribution, is perhaps lO^'^M©. 

For first scientific uses of this catalogue, see the BH spin evolution study of Rafiee & 
Hall (2009) and the sub-Eddington boundary study of Rafiee & Hall (2011). 
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8.1. Comparing with Shen et al. 2008 

Shen et al. (2008) have estimated black hole masses using the FWHMs of the Mgll, H/3 
and C IV emission lines for objects in the SDSS DR5 quasar sample. We have extracted the 
FWHM Mgll BH masses, Lboi, and the FWHM of the same objects in our DR3 catalogue 
from Shen at al. (2008). 

Figure [10] (a and b) shows the relationship between our bolometric luminosity and that 
of Shen et al. (2008) for the same objects. There is a 0.1 dex scatter on the distribution of 
the log of the ratio of AL3000 measurements (panel b) which is due to statistical errors. 

The Mgll emission line width comparison shows a significant difference between our 
estimates and those of Shen et al. (2008) (see Figure [TTl panels a to c). The larger scatter in 
Figure [TTk is due to a systematic underestimate of a in unreconstructed low-SNR spectra. 
Figure [TTJi shows the distribution of the ratio of our measured line dispersion to the FWHM 
measured by Shen et al. (2008) in the same objects, relative to the value of 1/2.3548 expected 
for a Gaussian. At fixed FWHM, most quasar Mgll emission lines have less flux in their 
wings than a Gaussian would. 

Figure \T2\ compares our log Mbh estimates with those of Shen et al. (2008). Figure [T^ 
illustrates the effect of the systematic underestimation of Mbh in unreconstructed low-SNR 
spectra (Figure [6^). However, even in PCA-reconstructed spectra (Figure [T2b) our BH mass 
estimates differ significantly from those of Shen et al. (2008). We note from Figure [6)d that 
there are no systematic errors in our mass measurements from reconstructed spectra as a 
function of SNR; we therefore have confidence in our measurements. Figure TCIb shows that 
the Shen et al. (2008) masses are overall somewhat larger than ours. Figure (T^b illustrates 
that relative to our masses from PCA-reconstructed spectra, the Shen et al. (2008) masses 
are, on average, overestimated at higher masses and underestimated at lower masses. 

The Mgll emission line width comparison shows a significant difference between our 
estimates and those of Shen et al. (2008) (see Figures [TTb to [TTb). However, we note from 
Figure [6)d that there is no systematic errors in our mass measurements as a function of SNR. 

8.2. Malmquist Bias and the true distribution of BH masses 

A Malmquist bias {6m', Eddington 1913; Malmquist 1922), i.e., the difference between 
the mean values of the distributions of the true BH masses and the observed BH masses, is 
estimated for each of our three scenarios. This Malmquist bias, which is a selection effect, 
exists since we are using a fiux-limited sample of quasars and since the mass function of 
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quasars is very steep. There are more quasars with small masses being scattered to large 
masses in our sample by random uncertainties than there are quasars with large masses being 
scattered to small masses in our sample, leading to overestimated BH masses on average. 

We use the generalized idea of the bias at fixed observed luminosity described in detail 
by Shen et al. (2008). This bias is estimated as 6m = — 7MC"xln(10)/(l + C2)^ where C2 is the 
slope of the assumed relationship between log-^BHtr«e (loS^)> where T = Lhoi/LEdd 
is the Eddington ratio and 7m is the slope when we assume a power-law distribution for the 
underlying true BH masses as A^(A^gjj) oc A^gjj'''*'^. The cxx is defined as the dispersion of 
the true distribution of the Eddington ratio (see Shen et al. 2008 for more details). Shen et al. 
has modeled the underlying distribution of the true BH masses and estimated C2 = 0.3 and 
7m = —2.6 for the redshift bin of0.7<z<1.0. We assume the same distribution of true BH 
masses and estimate the ctt for all three of our scenarios. We then estimate the Malmquist 
bias {6m) of the three scenarios to be 6M{bb) ~ 0.44, 6M{ob) ~ 0.40, and 6M{cia) ~ 0.11. 
However, we do not apply these estimated Malmquist biases on reported BH masses in the 
catalogue, as the calculation of the Malmquist bias is model dependent and users of the 
catalogue may wish to apply a differently calculated correction. The Malmquist bias for 
the case after applying both PCA and RPC is very small since the BH mass distribution 
is dominated by the narrow distribution of the luminosity due to the radiation pressure 
correction (see Equation [T2l) . 



9. Conclusions 

We have measured virial BH masses and radiation pressure adjusted virial BH masses 
for 27,602 quasars at 0.7 < z < 2.0 in the SDSS DR3 quasar catalogue, and have made our 
mass estimates available to the community. We have used the second moment of the Mgll 
emission line profile to estimate the line width and have calibrated a virial mass estimator 
using it and the monochromatic luminosity at SOOOA. The virial BH masses are typically in 
the range 10^ — 1O^'^A^0 for our quasar sample (Figures [S] and [H]). 

We have reconstructed the quasar spectra using Principal Component Analysis to in- 
crease the number of quasars for which reliable masses can be measured and to eliminate 
systematic biases due to the presence of noise. We have tested the reliability and biases 
of the mass measurements as a function of SNR before and after this PCA reconstruction. 
Our noise simulation shows no systematic bias in measured BH masses as a function of SNR 
(section [7]) . Using PCA reconstructed spectra for measuring BH masses reduces the intrinsic 
scatter in the BH mass estimates. 
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For a subsample of Shen et al. (2008) quasars in common with our sample, we have 
compared the estimated bolometric luminosity, the line width, and the BH mass. The 
bolometric luminosities are mutually consistent, with 0.1 dex statistical scatter. However, 
with respect to this study, Shen et al. (2008) have overestimated the line width in the case of 
the broadest emission lines and underestimated the line width in the case of the narrowest 
emission lines. This difference in line width measurement has affected the BH mass estimates 
such that those of the most massive BHs in Shen et al. are overestimated and those of the 
least massive BHs are underestimated with respect to this study. We beheve our black 
hole mass estimates are accurate. Improved understanding of black hole masses is necessary 
for understanding properties of the distribution of quasars in parameter space, such as the 
sub-Eddington boundary (Steinhardt & Elvis 2010; Rafiee & Hall 2011). 
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Fig. 1 . — The best linear fit to the reference sample in Table [T] using Markov Chain Monte- 
Carlo Simulation (MCMC; Haario et al. 2006) method with 33% intrinsic scatter. The gray 
areas correspond to: 50%, 90%, 95% and 99% posterior regions. 
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Fig. 2. — The top panel shows the relative change in quasar redshift after cross correlating 
the median quasar composite spectrum (Vanden Berk et al. 2001) with SDSS quasar spectra 
versus the SDSS redshift estimates described in §4.10.2.1 of Stoughton et al. (2002). The 
colormap is based on the estimated redshift confidence from the SDSS catalogue (Stoughton 
et al. 2002). The bottom panel shows the distribution of the relative change in quasar 
redshift. See the online version for color Figures. 
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Fig. 3. — A sample SDSS quasar spectrum, before and after reconstruction using PCA, 
and our best pseudo-continuum estimates for it. The top panel shows the observed SDSS 
spectrum (black) and continuum (green) and the best pseudo-continuum fit (red). The 
middle panel shows the PCA reconstructed spectrum (blue) and continuum (green) and 
the best pseudo-continuum fit (red). The bottom panel shows the reconstructed spectrum 
in cyan after subtracting the best pseudo-continuum. The dotted red vertical lines show 
the normalization windows where we initially estimate the normalization of the power law, 
2238 - 2248 A and 3014 - 3027 A. The dotted blue lines show two emission lines Mgll and 
[Ne iv] . The dot-dashed lines determine the ranges within which we calculate the of the 
pseudo-continuum fit. The black dashed lines are the borders of the fitting region. Narrow 
absorption lines and noise spikes within the Fe ll fitting region have been removed through 
PCA reconstruction. See the online version for color Figures. 
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Fig. 4. — A typical quasar spectrum with noise added to produce spectra with different 
SNR levels (cyan). The spectrum reconstructed in each case using the 50 most significant 
PCA eigenspectra from Yip et al. (2004) is overplotted. (a) Reconstructed spectrum shown 
in black generated from spectrum with original SNR in cyan, (b) Reconstructed spectrum 
in blue generated from spectrum with 1/2 original SNR. (c) Reconstructed spectrum in 
red generated from spectrum with 1/4 original SNR. (d) Reconstructed spectrum in green 
generated from spectrum with 1/8 original SNR. (e) Reconstructed spectrum in magenta 
from spectrum with 1/16 original SNR. (f) over-plotting the 5 reconstructed spectra in 
panel (a) to (e). See the online version for color Figures. 
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Fig. 5. — The distribution of the xl comparing the reconstructed spectra using PCA tech- 
nique with raw spectra for two cases: sohd curve compares within the window 2700 A< 
A < 2900 A, dash-hne curve compares the whole spectrum. See the onhne version for color 
Figures. 
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Fig. 6. — The log difference of the BH mass estimates for four scenarios. Different objects 
are represented in different colors. In all Figures, subscript a stands for after and subscript h 
stands for before. The first subscript states the PCA status while the second subscript states 
the Radiation Pressure Correction (RFC) status. Masses are estimated (a) before applying 
FCA and before RFC; (b) after applying FCA and before RFC; (c) before applying FCA 
but after RFC; (d) after applying FCA and after RFC. The cutoff on lower mass estimates 
in panel (c) is due to the asymptotic limit when 0. The presence of the light blue 

points at x-axis ~ —0.7 in this panel are due to the presence of the broad absorption line 
(BAL) in the Mgll window of an object. See the online version for color Figures. 
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Fig. 7. — The distribution of the log difference of the BH mass estimates for two SNR bins: 
(a) and (c) 1.5 < SNRj < 2.4; (b) and (d) 18 < SNRj < 23. All plots are after applying 
PCA and for two scenarios: (a) and (b) before RPC; (c) and (d) after RFC The mean (/x) 
and the standard deviation (a) of each distribution are reported in each panel. Comparing 
Figure (a) to (b) or comparing Figure (c) to (d) shows a factor of 3-3.7 improvement in mass 
estimates when high SNR is used. Comparing Figure (a) to (c) or comparing Figure (b) to 
(d) we have a factor of 2-2.5 improvement due using RPC. 
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Fig. 8. — The log A^bh versus redshift plane for three scenarios, (a) Before applying PCA 
and RFC. (b) After applying FCA and before RFC. (c) After applying FCA and RFC. The 
lack of low mass BHs at high z and high mass BHs at low z is due to detection and saturation 
limits. BH masses between 10^'^ and 10^'^ are distributed all over our redshift range. See 
the online version for color Figures. 
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Fig. 9. — Contours of log A^bh solar units versus augii plane for three scenarios, (a) 
Before applying PCA and RFC (b) After applying FCA and before RFC. (c) After applying 
FCA and RFC. The black curves show the log A^gj^ versus aMgii for a fixed log Lj^oi ranging 
from 44 to 47. The red line log A^gfj ~ represents the upper mass limit suggested by 
Shen et al. (2008). See the online version for color Figures. 



- 31 - 





Fig. 10. — LogLboi{Shen) versus log Lboi{RSzH) for two scenarios, (a) Before applying PCA 
and RFC (c) After applying FCA and before RFC. Lf, and La are Luminosities before and 
after applying FCA (but before applying RFC) respectively. Both panels show consistency 
between Shen et al. (2008) and this study; in both, there is a 0.1 dex scatter on the 
distribution (panel b) due to statistical errors. See the online version for color Figures. 
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Fig. 11. — Log(FWHM5'/ien/2.3548) versus logaR^n for three scenarios, (a) Before applying 
PCA and RFC. (b) After applying FCA and before RFC. (c) After applying FCA and RFC. 
Although cTfl (the line dispersion after applying FCA) is the same in both (b) and (c) they 
generate different A^gjj distributions since the scaling relationship is different, at is the 
line dispersion before applying FCA. Fanel (d) shows the distribution of the ratio of our 
measured line dispersion to the FWHM measured by Shen et al. (2008) in the same objects, 
relative to the value of 1/2.3548 expected for a Gaussian. See the online version for color 
Figures. 
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Fig. 12. — Log^A-Q^{Shen) versus log A^gjj(i?&if) in solar units for two scenarios, (a) 
Before applying PCA and RFC (c) After applying FCA and before RFC. Fanel (b) shows 
that the Shen et al. (2008) masses are overall somewhat larger than ours; Mfyj, and Mat 
are our A^gpj values computed before and after applying FCA (but before applying RFC) 
respectively. The steeper slope in panel (c) is due to the same effect on augii measurement 
as in Figure [HI meaning that relative to our masses, the high mass BHs are overestimated 
in Shen et al. (2008) while the low mass BHs are underestimated. See the online version for 
color Figures. 



Table 1. Calibration Data. 



Object 
1 



"Mgll AL3000/IO 
km/s erg/s 
3 4 



days 
5 



Mbh 
<f0 X 10 



1 Virial M^ff 
M0 X 10^ 



flag 2 Adjusted Virial M^H 

M,0 X 10'' 
9 " 10 



"bulge 
km/s 
11 



3C390.3 



Aknl20(Arkl20) t 
Fairallg 



LWP04500 
LWR16609 

LWP 16776 
LWP03808 
LWP17245 
LWP 18382 
LWP 17269 
LWP17478 

Y29E0106T 
LWP24094 
LWP24095 
LWP06492 
LWP 19270 
LWP 19664 
LWP21476 
LWP21474 
LWP21473 
LWP19916 



3810 tll fo 56.6 



19533.8 
1377.7 



0.5397 
0.0417 



23.60 j:*-^^ 
+4.16 
-4.95 



39.05 



17 40 ™ 
J-f.'tu _4 3Q 



+ 31.4 
-22.5 



287.0 

150.0 



255.0 



87.0 ± 3.7 



222 ± 10 



162.0 ± 24.0 



-1 " 1610 ± 220 
1 2.2 ± 4.0 



3870 ± 540 
6.7 ± 11 



240.0 ± 36.0 
239.0 ± 36.0 



CO 



34.4 ± 0.8 



109.6 ± 3.0 



MrkllO 



LWP12760 
LWP12761 



24.37 



+6.1 
-6.1 



9.0 ± 1.2 



26.3 ± 4.0 



86.0 ± 13.0 



LWR11470 
LWR03512 
LWR09917 
LWR11947 
LWR14554 
LWR14555 
LWP 19466 
LWP19123 
LWP16819 



+3.88 
-3.34 



+ 3.7 
-3.7 



76.3 ± 4.1 



LWR1371fi 
LWR01636 
LWP14218 
LWR01309 
LWP 14634 
LWP 10829 
LWP15474 
LWR06219 
LWP18844 
LWP 18786 
LWP 18786 



+6.10 
-5.40 



+ 12.0 
-12.0 



45.6 ± 0.8 



142.0 ± 3.2 



Table 1 — Continued 



Object 


DatalD 




Ai^aooo/lO^'* 






flag 1 


Virial M^jf 


flag 2 


Adjusted Virial Mbh 


""bulge 






km/s 


erg/s 


days 


Mq X 10^ 




Mq X lO'' 




Mq X 10^ 


km / s 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 



MrkB90 



LWP19B77 
LWR13721 



+ 2.16 
-2.01 



+ 7.4 
-7.4 



26.6 ± 2.6 



67.5 ± 7.1 



194.0 ± 20.0 



Mrk817 
NGC3783 



LWR06141 
LWR01320 



LWR11936 
LWP23003 
LWP23318 
LWP23311 
LWP23298 
LWP23289 
LWP23280 
LWP23094 
LWP23075 
LWP23073 
LWP23035 
LWP22902 
LWP22876 
LWP22845 
LWP22827 
LWP22826 
LWP23074 



0.4975 
1.0361 



+ 4.02 
-3.80 



19.06 



+ 3.30 
-2.30 



52.4 
49.4 



+ 14.4 
-14.4 



+ 7.7 
-7.7 



+ 5.4 
-5.4 



10.7 ± 1.6 1 
24.7 ± 1.8 1 



4.3 ± 1.2 



29.5 ± 5.0 130.0 ± 20.0 

67.5 ± 5.5 142.0 ± 21.0 



12.5 ± 4.1 



00 



95.0 ± 10.0 



NGC6648 



NGC7469 t 
PG0844+349 



LWP23153 
LWP19265 
LWP27298 
LWP27297 
LWP24347 
LWPlllOO 

1131.4 

LWP24683 
LWP25a88 
LWP25462 
LWP25440 
LWP25409 
LWP25464 
LWP25422 
LWP25483 
LWP25522 
LWP25514 
LWP25496 

1971.3 

Y3B4010CT 1798.8 
LWP12206 1047.1 



0.3769 

0.5957 
4.6751 



, +2.60 
-1.80 



4.54 
31.99 



+ 0.97 
-0.9S 
+ 0.64 
-0.68 
+ 13.45 
-12.64 



+0.78 
-0.78 



67 1 +^-'^ 
^'■^ -2.6 

12 2 

'-^■^ -1.4 
92.4 



0.7 ± 6.2 



13.7 ± 2.7 

14.4 ± 1.8 
13.6 ± 0.2 



1.6 ± 16.0 



35.9 ± 7.7 

39.2 ± 5.5 
69.9 ± 1.7 



84.0 ± 9.0 



183.0 ± 27.0 
162.0 ± 16.0 
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Table 1 — Continued 



Object 


DatalD 




-^-^3000/10^^ 


< T >Ha,/3,1 


Mbh 


flag 1 


Virial Mbh 


flag 2 Adjusted Virial Mbh 








km/s 


erg/s 


days 


Mq X 10® 




Mq X 10® 




Mq X 10® 


km/s 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


PG1211+143 


LWP13628 

LWP07603 
LWP07223 
LWP23329 
LWP19386 




















PG1226+023 + 




958.4 


8.8465 


103.94 tllil 


146.0 


1 


15.7 ± 0.1 


-1 


108.0 ± 1.5 




Y0G4()2()FT 


1816.2 


85.9274 


352.99 tf^ ll 


886.0 


1 


175.8 ± 0.2 


-1 


1104 lb 1.6 




PG1229+204 


LWR13136 
LWR16071 
























1568.2 


3.1430 


33.33 ^12. 35 


700 +35.2 
'■5-^ -35.2 


-1 


24.8 ± 0.6 


1 


84.4 ± 2.6 


162.0 ± 32.0 * 


PG1351+640 t 


Y0P8030BT 
O6B616010 
























1995.4 


6.9307 






-1 


60.3 ± 0.7 


-1 


199.7 ± 2.8 




PG1411+442 1" 


Y11U()103T 


1700.3 


3.9038 


101.58 ±^1;™ 


443 +l*<i <' 


1 


32.8 ± 0.6 


-1 


109.8 ± 2.7 




PG2130+099 


LWR04610 
LWP03668 
























1478.9 


3.9699 


177 22 +19-9<1 


457 +^^-° 
^OI.KJ _55.0 


1 


25.1 ± 0.5 


1 


91.7 ± 2.3 


172.0 ± 46.0 1 


Note. — Column 1 details the name of the objects and its 


resources. Column 


2 are the lUE or 


HST spectra. 


. Columns 3 and 4 arc 


the Mg II line dispersion in un 


its of km s ^ and 


the monochromati 


ic luminosity at 


3000A respectively. Column 5 is the average 


time-lag in units of light-days from Peterson 


et al. 


. 2004. Column 6 is the reverberation mapping 


BH mass estimate 


is from Peterson et al. 2004 


. Columns 8 


and 10 are the BH 


mass estimates for two scenarios, before or 


after 


applying the radiation correction, respectively. 



Columns 7 and 9 are the outlier flag in regression analysis; flag 1 for Virial M^h ^■i^d ^a-g 2 for Adjusted Virial M^h- Column 11 is the stellar velocity dispersion of the bulge 
from Onken et al. 2004 and Dasyra et al. 2007. 

^HST data. 

^ Bulge Velocity dispersion extracted from Dasyra et al. 2007. 
"^Double peak line. 
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Table 2. log(r) = a log(£) + /3, Fitting Results. 



Fitting Method 


Slope (a) 


Intercept 


Scatter 


FITEXY 


0.55 ± 0.02 


1.49 ± 0.02 





FITEXY 


0.47 ± 0.12 


1.46 ± 0.09 


0.35 


BCES(Y|X) 


0.48 ± 0.16 


1.46 ± 0.10 





bootstrap 


0.51 ± 0.20 


1.43 ± 0.10 





BCES Bisector 


0.63 ± 0.11 


1.44 ± 0.10 





bootstrap 


0.66 ± 0.18 


1.43 ± 0.11 





MCMC t 


0.40 ± 0.10 


1.46 ± 0.08 


0.33 



^Markov Chain Monte- Carlo Simulation 
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Table 3. Catalogue Format. 



Col 


Format 


Units 


Label 


Description 


1 


A18 




Name 


SDSS J 


2 


F11.6 


deg 


RAdeg 


Right Ascension in decimal degrees (J2000) 


3 


F11.6 


deg 


DEdeg 


Declination in decimal degrees (J2000) 


4 


F7.5 




z 


redshift 


5 


F7.4 




imag 


PSF i-band apparent magnitude 


6 


F7.4 




iMag 


PSF i-band absolute magnitude, K-corrected to z=2 


7 


16 


d 


MJD 


Modified Julian Date 


8 


15 




Plate 


Plate number 


9 


15 




Fiber 


Fiber identification number 


10 


F7.4 


A 


si 


Instrumental sigma 


Before applying PCA(2) and RPC{2) 


11 


F7.4 


A 


sim_b 


o'Mgll estimates from the second moments 


12 


F7.4 


A 


st_b 


true o-j^jgji in A 


13 


F9.6 


erg cm~^ A~^ 


F3.b 


Fa at 3000 A 


14 


F7.4 


erg 


LL.b 


Monochromatic luminosity at 3000A, XLx in units 1E44 (4) 


15 


F8.4 


light-days 


RJd.b 


Radius of the BLR 


16 


F8.3 




SN.G.b 


Median SNR in G band 


17 


F8.3 




SN.R.b 


Median SNR in R band 


18 


F8.3 




SNJ.b 


Median SNR in I band 


19 


F7.4 


Mq 


MBH.bb 


BH mass in solar units 


20 


F7.4 


Mq 


Err.MBH.bb 


Error in BH mass 


After applying PCA(2) but before RPC(2) 


21 


F7.4 


A 


sim_a 


CMgll estimates from the second moments(3) 


22 


F7.4 


A 


st_a 


true UMgll in A (3) 


23 


F9.6 


erg cm~^ 


F3.a 


Fx at 3000 A (3) 


24 


F7.4 


erg s~^ 


LL_a 


Monochromatic luminosity at 3000A, \L\ in units of 1E44 (3) (4) 


25 


F8.4 


light-days 


RJd_a 


Radius of the BLR(3) 


26 


F8.3 




SN.G-a 


Median SNR in G band 


27 


F8.3 




SN.R.a 


Median SNR in R band 


28 


F8.3 




SNJ.a 


Median SNR in I band 


29 


F7.4 


Mq 


MBH^b 


BH mass in solar units (3) 


30 


F7.4 


Mq 


Err.MBH.ab 


Error in BH mass (3) 


After applying PCA(2) and RPC(2) 


31 


F7.4 


Mq 


MBH_aa 


BH mass in solar units 


32 


F7.4 


Mq 


Err_MBH_aa 


Error in BH mass 


33 


F7.4 




zJlH 


redshift re-estimated by this study 


34 


11 




DR3c 


In DR3 complete subsample? (0=no, l=yes) 



Note. — (1) Entries reading 0.0000 or 0.0000e-|-00 indicate that the quantity was not measurable. (2) PGA stands for 
Principal Component Analysis and RPC stands for Radiation Pressure Correction. (3) Recommended results. (4) Bolometric 
Luminosity can be estimated using BC(3000A) = 5.15 and the monochromatic luminosity at 3000A (Shen et al. 2008). Until 
the catalogue is available through the ApJS website it can be downloaded as |http: / /ara.phys.yorku.ca/rafieehall2011.txt) 



